Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis

Identifieur interne : 000D37 ( Main/Exploration ); précédent : 000D36; suivant : 000D38

Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis

Auteurs : Eveline Wandl-Vogt [Autriche]

Source :

RBID : Francis:09-0208472

Descripteurs français

English descriptors

Abstract

The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author>
<name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<placeName>
<region type="land" nuts="2">Vienne (Autriche)</region>
<settlement type="city">Vienne (Autriche)</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">09-0208472</idno>
<date when="2008">2008</date>
<idno type="stanalyst">FRANCIS 09-0208472 INIST</idno>
<idno type="RBID">Francis:09-0208472</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000248</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000551</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000259</idno>
<idno type="wicri:doubleKey">0268-1145:2008:Wandl Vogt E:zitate:per:mausklick</idno>
<idno type="wicri:Area/Main/Merge">000D49</idno>
<idno type="wicri:Area/Main/Curation">000D37</idno>
<idno type="wicri:Area/Main/Exploration">000D37</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author>
<name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<placeName>
<region type="land" nuts="2">Vienne (Autriche)</region>
<settlement type="city">Vienne (Autriche)</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
<imprint>
<date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational lexicography</term>
<term>Dialect</term>
<term>Textual database</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Lexicographie informatique</term>
<term>Base de données textuelles</term>
<term>Dialecte</term>
<term>Bavarois-Autrichien</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Autriche</li>
</country>
<region>
<li>Vienne (Autriche)</li>
</region>
<settlement>
<li>Vienne (Autriche)</li>
</settlement>
</list>
<tree>
<country name="Autriche">
<region name="Vienne (Autriche)">
<name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000D37 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000D37 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Francis:09-0208472
   |texte=   Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024